Data-driven phonetic comparison and conversion between south african, british and american English pronunciations

نویسندگان

  • Linsen Loots
  • Thomas Niesler
چکیده

We analyse pronunciations in American, British and South African English pronunciation dictionaries. Three analyses are perfomed. First the accuracy is determined with which decision tree based grapheme-to-phoneme (G2P) conversion can be applied to each accent. It is found that there is little difference between the accents in this regard. Secondly, pronunciations are compared by performing pairwise alignments between the accents. Here we find that South African English pronunciation most closely matches British English. Finally, we apply decision trees to the conversion of pronunciations from one accent to another. We find that pronunciations of unknown words can be more accurately determined from a known pronunciation in a different accent than by means of G2P methods. This has important implications for the development of pronunciation dictionaries in less-resourced varieties of English, and hence also for the development of ASR systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing direct G2P with G2P followed by accent conversion when determining pronunciations for South African English

It has been shown that techniques known as grapheme-and-phoneme-to-phoneme (GP2P) conversion can be used to derive pronunciations in a poorly-resourced accent, such as South African English, using available pronunciations in better-resourced accents of the same language, such as British and American English. However if the pronunciation is not available in either accent, it must be obtained usi...

متن کامل

The generation of regional pronunciations of English for speech synthesis

Welsh and Northern English), and two American ones (New York and South Carolina, to represent Eastern and Southern American); regional features were based primarily on the descriptions in [1], with native-speaker input where possible. The regional accents are abbreviated in this paper as: Br(Sc) = Edinburgh; Br(W) = Cardiff; Br(N) = Leeds; Am(E) = New York; and Am(S) = South Carolina. For the s...

متن کامل

The Generation of Regional Pronunciations of English for Speech Synthesis1

Welsh and Northern English), and two American ones (New York and South Carolina, to represent Eastern and Southern American); regional features were based primarily on the descriptions in [1], with native-speaker input where possible. The regional accents are abbreviated in this paper as: Br(Sc) = Edinburgh; Br(W) = Cardiff; Br(N) = Leeds; Am(E) = New York; and Am(S) = South Carolina. For the s...

متن کامل

Study Existing Various Phonetic Algorithms and Designing and Development of a working model for the New Developed Algorithm and Comparison by implementing it with Existing Algorithm(s)

A phonetic algorithm is an algorithm to identify words with similar pronounce and is used to index the words based on their pronunciation. Most of the algorithms are designed to work with English language. These algorithms are complex by nature due to many rules and exceptions in English pronunciation and change in evolving English language with adoption of many words from other languages. Also...

متن کامل

Effects of non-native dialects on spoken word recognition

The present study examined the premise that lexical information (top-down factors) interacts with phonetic detail (bottom-up, episodic traces) by assessing the impact of dialect variation and word frequency on spoken word recognition. Words were either spoken in the listeners’ native dialect (Australian English: AU), or in one of two non-native English dialects differing in phonetic similarity ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009